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1» Introduction 


Diptin Technical Specification 


This document is the technical specif ication of DIF, the data 
interchange format developed by Software Arts, Inc. It is a 
reference document and not a tutorial. It includes a description 
of the DIF file Organization and structure, required items, and 
optional Standard items. It also explains the use of the optional 
Standard items by specific applications. Appendix A is an example 
of a DIF data file. 

Programs should use defined Standard items when possible. The DIF 
Clearinghouse will update this document to describe new items as 
they are defined and record their use in specific programs. 
Programmers developing new Software that incorporates new 
optional items should inform the Clearinghouse fully about them 
so that they can be standardized for common use by any program 
supporting the DIF format. 

Programmers should remember that the program reading the data can 
be extremely simple. The program writing the data must handle it 
in such a way that it can be read by any program supporting DIF. 

Within this text, upper case characters are actual values to be 
entered as shown and lower case characters name the value to be 
entered to a field. It is assumed that the ASCII character set is 
being used. See the section on Definitions for a discussion of 
character sets. 


2. Constraints of the Format 


The DIF format was designed for ease of use, and ? for the sake of 
simplicity, certain constraints have been imposed on the format. 
Because DIF is not intended to be a universal representation for 
all data, one of these constraints is the representation of data 
in tables with rows of equal length and columns of equal length. 
A second constraint is that, because many users program in BASIC, 
the files must be compatible with BASIC programs. Programs 
written in another language, such as Pascal, can use a set of 
subroutines to read and write DIF files. 

Below is a list of specific constraints on a DIF file. 


1. Because some BASICs have only primitive facilities for 
reading and writing strings, the Convention of keeping 
numbers and strings on separate lines has been 
adopted. 

2. Two items, VECTORS and TUPLES, are required to Support 
Systems that require preallocation of space. 
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3. Because some Systems do not allow programs to test for 
the end of a file, a special data value, EOD, provides 
graceful termination to a program. 

4. To simplify programming, there are only two formats 
within the file, and all fields are predefined as 
character strings or numbers. 

5. Strings must be enclosed in quotes if they contain 
characters other than alphanumerics. 

6. The character set is restricted to the printable ASCII 
characters• 

7. Although DIF places no explicit restriction on the 
length of data strings, some Systems may impose 
restrictions. 

Since the DIF format is not meant to meet all the needs for data 
representation, it may be necessary to use multiple DIF files or 
additional formats for some applications. A word processor, for 
example, would not use a DIF file to störe text but could use DIF 
files for tables of values within a report. 


3. Organization of the DIF Data File 

A DIF file is a text file using the Standard printable character 
set of the host machine. The model for the data is a table. 
Fields are called vectors; records are called tuples. Data is 
organized into vectors of equal length. Each tuple consists of a 
row of corresponding values read across each vector. The user 
determines the specific groupings of vectors and tuples. Often 
vectors are treated as columns and tuples are treated as rows, 
but because DIF can transpose columns and rows, the terms vectors 
and tuples are used instead of the terms columns and rows. 

The DIF file consists of two sections, a header section and a 
data section. The header section contains descriptions of the 
file and the data section contains the actual values. 


4. The Header Section 

The header section is composed of header items. There are four 
Standard required header items and several Standard optional 
header items. 
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4.1 The Header Item 

The header items describe the data Organization. Each header item 
consists of four fields arranged on three lines as illustrated 
below. The first line is a token, the second line consists of 
two numbers, and the third line contains a String. 


Topic 

Vector Number, Numeric Value 
"String Value" 


4.1.1 The Topic 

The first line of the header item is the Topic. It identifies the 
header item, and must be a token. 


4.1.2 The Vector Number 

The first field on the second line is the Vector Number. If the 
header item describes a specific vector, the Vector Number 
specifies the vector being described. If the header item 
describes the entire file and not one specific vector, the Vector 
Number is zero (0). 


4.1.3 The Numeric Value 

The Value is an integer and occupies the second field of the 
second line, separated by a comma from the Vector Number. If the 
header item does not use a numeric value, the Value is zero (0). 


4.1.4 The String Value 

The String Value occupies the third line of the header item. The 
String Value is always enclosed in quotation marks. If it is not 
used, the line consists of a null String, a pair of quotations 
marks with no space between them. 


1 A token is an upper case String of alphanumeric characters. It 
is usually short, 32 characters or less. See the Definitions 
section for more Information. 
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There are four required header items. The other header items 
described in this document are Standard optional header items. 
The defined Standard items should be used by new programs using 
DIF. If it is absolutely necessary, a new header item may be 
defined to meet the needs of a particular program. For details, 
see the section on Defining New Header Items. 

A program may ignore all header items until it finds the header 
item DATA, described below. 


The following four header items are required; 


4.2.1 The First Header Item 

TABLE 
0,version 
"title" 


The header item TABLE must be the first entry in the file. It 
identifies the file as a DIF file. The version number must be 1. 
The "title“ is the title of the table and describes the data. 


4.2.2 Vector Count 


VECTORS 
0,count 

n n 


The header item VECTORS specifies the number of vectors in the 
file. 

Note; This header item must appear before header items that refer 
to vector numbers. Otherwise, it can appear anywhere within the 
header section. 


4.2.3 Tuple Count 


TUPLES 
0,count 


The header item TUPLES specifies the length of each vector (the 
number of tuples) . This can be used by a program to preallocate 
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storage space for the data. This item may appear anywhere within 
the header section. 

Note: Programs reading the data assume that the tuple count is 
correct. Some programs may be able to generate this Information 
only after all data has been generated. These programs must 
reread the DIF file to count the tuples, and rewrite the TUPLES 
item with the correct count. 


4.2.4 The Last Header Item 

DATA 
0,0 
ii ii 


The header item DATA must be the last header item. It teils the 
program that all remaining data in the file are data values. 

The following header items are optional. The programs 
that are known to use them are noted with the item. For 
detailed information on each program's specific use of 
the item, see the section below on Applications Programs. 


4.2.5 Vector Label 
LABEL 

vector#,line# 

"label" 

The header item LABEL provides a label for the specified vector. 
The line number provides an Option for labels that span more than 
one line, and can be ignored by a System that allows single line 
labels only. The values 0 and 1 are equivalent line numbers. 

Note: Some programs do not use the LABEL field. If the first 
vector in a tuple contains String values, the first data value in 
the tuple may be treated as a label. 

Used by the VisiPlot tm and VisiTrend/VisiPlot tm programs. 


4.2.6 Vector Comment 

COMMENT 
vector#,line# 
"comment" 
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The header item COMMENT is similar to LABEL. It provides an 
Option to Systems that allow an expandecl description of a vector 
in addition to a label. 

Used by the VisiPlot and VisiTrend/VisiPlot programs. 


4.2.7 Field Size 


SIZE 

vector, bytes 


The header item SIZE provides to programs such as data base 
Systems the Option to allocate fixed size fields for each value. 

Because SIZE is an optional item, programs using SIZE must be 
able to read files produced by programs unable to generate SIZE 
information. 

Used by the CCA/DMS program. 


4.2.8 Periodicity 

PERIODICITY 
vector#,period 


The header item PERIODICITY provides the Option of specifying a 
period in a time series. 

Used by the VisiPlot and VisiTrend/VisiPlot programs. 


4.2.9 Major Start 

MAJORSTART 
vector #,Start 

II If 


The header item MAJORSTART specifies the first year of a time 
series. 

Used by the VisiPlot and VisiTrend/VisiPlot programs. 
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MINORSTART 
vector#,Start 

h n 


The header item MINORSTART specifies the first period of a time 
series. 

üsed by the VisiPlot and Vis iTrend/Vis iPlot prograrns» 


4.2.11 True Length 

TRÜELENGTH 
vec tor #,leng th 

ii H 


The header item TRÜELENGTH specifies the portion of a vector that 
contains significant values. 

üsed by the VisiPlot and VisiTrend/VisiPlot prograrns. 


4.2.12 Units 

UNITS 
vector#,0 
"name" 

The header item UNITS specifies the unit of measure for the 
values in the given vector. Name is the unit, for example meters 
or ft . 

Used by TKlSolver. 


4.2.13 Display units 

DISPLAYUNITS 
vector#,0 
"Name" 


The header item DISPLAYUNITS specifies the unit in which the 
values in the given vector should be displayed. This unit may be 
different from the one in the UNITS field. The values in the 
given vector are always stored in the unit specified in the UNITS 
field, and the application program is responsible for making the 
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value conversion between the UNITS and DISPLAYUNITS„ 

For example, a vector might be stored in km, but displayed in the 
program in miles . The UNITS field would'be km, the DISPLAYUNITS 
field would be miles , and the values in the vector would be in 
km , Any program using the vector would have to define the 
conversion between km and miles to display the values in miles . 

Used in the TKlSolver program. 


4.3 Defining New Header Items 

If there is no Standard optional header item to fulfill the 
specific need of a Subsystem, a new header item may be defined. 
Because the DIF format is intended for common use, new optional 
header items should be standardized through the DIF 
Clear inghouse. They will then be added to this document. 

To be accepted as Standard items, new optional items must be 
consistent with existing conventions. 

An optional item extends the format for a specific application. 
Any program reading the DIF file should be able to operate 
without optional items. If a reading program requires the 
Information provided by an optional item, it should prompt the 
user to supply the missing Information and not require the item 
itself. 


5. The Data Section 


The data section consists of a series of tuples. The Data Values 
within the tuples are organized in vector sequence. 

Each Data Value represents one element of data in the file. The 
data may be either the actual data or one of the two Special Data 
Values that mark the beginning of a tuple (BOT) and the end of 
data (EOD) in the file. 

Each Data Value consists of two lines. The first line consists of 
two fields containing numeric values, and the second line 
consists of one field containing a String value. The format is: 

Type Indicator, Number Value 
String Value 


Page 8 



DIF tm Technical Specification 


5.1 The Type Indicator Field 

The Type Indicator is an integer that teils the program what kind 
of data is represented by this value. There are currently three 
possible values. 

-1 The data is a Special Data Value, indicating either the 
beginning of a tuple or the end of data. The Number Value is 
zero (0) and the String Value is either BOT or EOD. See the 
description below of Special Data Values. 

0 The data is numeric. The Number Value field contains the 
actual value and the String Value field contains a Value 
Indicator. See the descriptions below of the Number Value and 
String Value fields. 

1 The data is a string value. The Number Value is zero (0) and 
the String Value field contains the actual string value. 


5.2 The Number Value Field 

When the Type Indicator is 0, the Number Value field contains the 
actual value. The value must be a decimal (base 10) number. It 
may be preceded by a sign (+ or -) and it may have a decimal 
point. It may be preceded or followed by one or more blanks. If 
the data value contains an exponent of a power of ten, the value 
is followed by the letter E and the signed or unsigned exponent 
power of ten. 

Note: This is the only place where DIF allows a non-integer 
value. Some programs accept only integer values. 


5.3 The String Value Field 

The contents of the String Value field are dependent on the Type 
Indicator. 


5.3.1 Special Data Value 

If the Type Indicator is -1, the String Value is one of the two 
Special Data Values, BOT or EOD, and the Number Value is 0. 

Each tuple begins with the Special Data Value BOT (Beginning of 
Tuple). If a program cannot generate a VECTORS header item before 
generating all data, it can use the Special Data Value BOT to 
determine the number of vectors in the file by counting the 
number of Data Values between BOTs when it rereads the file. A 
program can also verify its position in a file by using the BOT 
Special Data Value. 
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The Special Data Value EOD (End of Data) indicates the end of 
data in the file. The EOD occurs at the end of the last tuple in 
the file. If the program is unable to generate a TUPLES header 
item before generating all data, it can determine the number of 
tuples by counting the number of BOTs before the EOD when it 
rereads the file. A program can also use the EOD Special Data 
Value to detect the end of the file. 


5.3.2 Numeric Value and Value Indicator 

If the Type Indicator is 0, the data is numeric, and the String 
Value is one of the Value Indicators described below. The Value 
Indicator Overrides the value. 

A Subsystem may define Value Indicators for its own needs. New 
Value Indicators should be registered with the DIF Clearinghouse. 

The Value Indicators currently defined ai:e: 

V Value - This is the String Value most commonly used with 

a numeric value. The Number Value contains the actual 
value. 

NA Not Available - The value is marked as not available. The 

Number Value is 0. 

ERROR The value represents the result of an invalid 
calculation. The Number Value is 0. 

TRÜE Logical value. The Number Value is 1. 

FALSE Logical value. The Number Value is 0. 

The String Value can be ignored in favor of the Number Value, or 
all values with a Value Indicator other than V can be considered 
nonexistent. Quotes are not permitted around the Value Indicator. 


5.3.3 String Value 

If the Type Indicator is 1, the String Value is the actual 
character string. If the value is a token, the quotation marks 
are optional. However, if there is a beginning quotation mark, 
there must be a terminating quotation mark. 


6. Definitions 


This section defines specific characteristics of DIF. 
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Character Sets This document assumes use of the ASCII character 
set. The following characters are permitted: 

!"(#$%&"()*+,-./ 

0123456789s ?< = >? 

0ABCDEFGHIJKLMNO 
PQRSTUVWXYZ[\] 

'abcdefghij klmno 
pgrstuvwxyz{| 

(The first character in this list is a space.) 

There are 95 printable characters, including the 
space. If the host Computer has more than 95 
characters, the additional characters must be 
mapped into the 95 ASCII characters to transfer 
data to another machine. 

Some Computers permit only 64 characters. When 
data is transfered to these machines, lower case 
characters and the characters are mapped 

into their corresonding upper case characters. If 
these transformations affect the integrity of the 
data, associated documentation should specify the 
effect. 

Transfers between character sets should be 
transparent to most users. To assure 
compatibility, strings should not contain 
nonprinting characters. 

EBCDIC EBCDIC is a binary representation of characters 

and is used primärily for large IBM Computers. An 
awareness of the representation used is not 
essential, but if files are transferred between 
machines they must be converted to the Standard 
representation of the host machine. 

Because EBCDIC defines more than the 95 Standard 
printable characters, users should avoid the 
additional characters when preparing data files 
on an EBCDIC machine. 

String Length Some programs place a length limit on strings 
that they read. This results in the truncation of 
long string values. Some Systems also limit the 
length of lines in a data file. Programs should 
support a minimal string length of 64 characters, 
but longer ones are preferable. 

String Delimiters 

Some Systems delimit strings with apostrophes 
instead of quotation marks. When files are 
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transferred to or from these Systems, appropriate 
changes must be made. 

Tokens A token is a String consisting of upper case 

alphanumeric characters. It should have a maximum 
length of 32 characters. Commonly, tokens may or 
may not be contained within quotation marks? 
however, a token that is a required string, such 
as a header item topic, must be represented 
without quotation marks. 

Floating Point Numbers 

A floating point number consists of an optional 
sign and a series of digits followed by an 
optional decimal point» The number may be 
followed by the letter E (exponent) and a signed 
decimal exponent. 

Note: Some Systems generate the letter D to 

indicate a double precision floating point 
number. This is not Standard, but it can be read 
by compatible programs within a single system. 
When transfering data to other Computers, the D 
must be converted to an E. 


7. Applications Programs 


This section records the specific use of DIF by applications 
programs that support it. Programmers who intend to interface 
with any of these programs should note the specifics listed here. 
Standardized optional items used by these applications are listed 
in the general section on Optional Header Items. However, if a 
program uses a header item that varies significantly from the 
conventions, it is mentioned only in this section. The accuracy 
of this information is not guaranteed. 


7.1 The CCA/DMS Program 

Published and distributed by VisiCorp, Inc. 
Uses: SIZE 


7.2 The TREND-SPÖTTER(R) Program 

Published and distributed by Software Resources, inc. 

The TREND”SPÖTTER program requires that the DIF file contain 
either only one tuple or only one vector. 
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7.3 The VisiCalc(R) Prograin 

Published and distributed by VisiCorp, Inc. 

Program created and written by Software Arts, Inc. 

The VisiCalc program does not generate the LABEL items. Some 
programs interfacing to the VisiCalc program have adopted the 
convention of examining the first Data Value in a tuple, and, if 
it is a string value, treating it as a label. 


7.4 TKISolver 

Published and distributed by Software Arts, Inc. 
Uses: UNITS, DISPLAYUNITS 


7.5 the VisiPlot and VisiTrend/VisiPlot Programs 
Published and distributed by VisiCorp, Inc. 

Early versions of the VisiPlot and VisiTrend/VisiPlot programs 
used the Number Value and String Value incorrectly, storing the 
Number Value in the String Value field. Programs exchanging data 
with these versions should check the String Value. If it is not 
null, the string must be converted and the Number Value computed. 

Uses: LABEL, COMMENT, MAJORSTART, MINORSTART, PERIODICITY, 

TRUELENGTH 

DIF is a trademark of Software Arts, Inc. 

TREND-SPOTTER is a registered trademark of Friend Information 
Systems. 

VisiCalc is a registered trademark of VisiCorp Inc. 

VisiPlot is a trademark of VisiCorp Inc. 

VisiTrend/VisiPlot is a trademark of VisiCorp Inc. 

TKISolver is a trademark of Software Arts, Inc. 
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I. Sample DIF File 


This is an example of a DIF data file. The data in the file is 
represented by the table below. 


PROFIT REPORT 


YEAR 

|SALES 

| COST 

[PROFIT 

1980 

100 

I 90 

10 

1981 

110 

101 

9 

1982 

121 

I 110 

11 


The Data File 


TABLE > 

0,1 > 

"PROFIT REPORT" > 

VECTORS -> Header > 

0,4 > > 

"" > Item > 

TUPLES > 

0,3 > 

II II > 

LABEL > 

1,0 > 

"YEAR" > Header 

LABEL > 

2,0 > 

"SALES" > 

LABEL > 

3,0 > 

"COST" > 

LABEL > 

4,0 > 

"PROFIT" > 

DATA > 

0,0 > 
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—1,0 -> 

BOT > 

0,1980 > 

V > 

0,100 > 

V > 

0,90 > 

V > 

0,10 > 

V > 

- 1,0 > 

BOT > 

0,1981 > > 

V > > Data 

0,110 > > 

V > > Part 

0,101--> Data > Tuple > 

V -> Value > > 

0,9 > > 

V > > 

- 1,0 > > 

BOT > > 

0,1982 > 

V > 

0,121 > 

V > 

0,110 > 

V > 

0,11 > 

V > 

- 1,0 > 

EOD > 
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II. Sample BASIC program that writes a DIF file 


This program enters Student records into a file by prompting the 
user for a Student's name and test scores and copying the 
Information into a DIF file. 


100 REM - 
110 REM - 
120 REM - 
130 REM - 
140 REM - 
150 REM - 
160 REM - 


THIS PROGRAM CREATES A DIF FILE CONTAINING THE 
NAME AND TEST SCORES OF A GIVEN NUMBER OF STUDE 
IT PROMPTS FOR A FILE NAME,, THE TOTAL NUMBER OF 
STUDENTS, AND THE NUMBER OF TEST SCORES FOR 
EACH STUDENT. IT THEN PROMPTS FOR A STUDENT'S 
NAME AND TEST SCORES, AND WRITES THEM TO THE 
FILE AS A TUPLE. 


1000 

PRINT "OUTPUT FILE NAME:"; 

: REM 

- 

GET FILE NAME. 

1010 

1020 

INPUT F$ 

OPEN "0" ,1,F $ 

: REM 


OPEN FILE FOR OUTP 

1030 

1035 

PRINT "NUMBER OF STUDENTS: 

11 • 

:REM 


PROMPT FOR NUMBER 

1040 

INPUT NT 

: REM 

- 

TUPLES. 

1050 

PRINT "NUMBER OF TEST SCORES PER STUDENTs"; 

1060 

INPUT NV 

: REM 

- 

NUMBER OF VECTORS 

1070 

NV = NV + 1 

: REM 

- 

NUMBER OF SCORE 

1080 

GOSUB 3000 

: REM 

- 

USE SUBROUTINE TO 

1090 


: REM 

- 

OUTPUT DIF HEAD 

2000 

FOR I = 1 TO NT 

: REM 

- 

OUTPUT A TUPLE FOR 

2010 


: REM 

- 

EACH STUDENT. 

2020 

2025 

T = -1: V = 0: S$ = "BOT" 

: REM 

_ 

OUTPUT BOT SPECIAL 

2030 

GOSUB 4000 

: REM 

- 

DATA VALUE. 

2040 

2050 

PRINT "NAME OF STUDENT 
INPUT S$ 

#" ?i; 

: REM 

_ 

GET NAME OF THIS S 

2060 

T = 1: V = 0 

: REM 

- 

OUTPUT AS STRING D 

2070 

GOSUB 4000 

: REM 

- 

VALUE. 

2080 

FOR J = 1 TO NV-1 

: REM 

- 

PROCESS EACH SCORE 

2090 

2100 

PRINT "SCORE #";J; 
INPUT V 

: REM 

_ 

GET SCORE. 

2110 

T = 0: S$ = "V" 

: REM 

- 

OUTPUT SCORE AS A 

2120 

GOSUB 4000 

: REM 

- 

VALUE. 

2130 

2140 

2150 

NEXT J 

NEXT I 

T = -1: V = 0: S$ = "EOD" 

: REM 


OUTPUT EOD SPECIAL 

2160 

GOSUB 4000 

: REM 

- 

VALUE. 

2170 

CLOSE 1 

:REM 

- 

CLOSE THE OUTPUT F 

2180 

STOP 

: REM 

- 

DONE. 
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3000 :REM - ROUTINE TO OUTPUT 

3010 PRINT#!,"TABLE":PRINT#1,"0,1"sGOSUB 3500 
30 20 PRINT#1, "TUPLES" : PRINT# 1, 11 0 , " ; NT sGOSUB 35 00 
3030 PRINT#1,"VECTORS" : PRINT#!," 0, " ?NVsGOSUB 3500 
3040 PRINT#!," DATA" : PRINT#!, " 0,0" sGOSUB 3500 


3050 RETURN 


3500 


: REM 

- ROUTINE 

TO OUTPUT 

3510 


;REM 

NULL 

STRING ("" 

3520 

3530 

PRINT#1,CHR$(34);CHR$(34) 
RETURN 

: REM 

- PRINT 2 

QUOTATION 


4000 

:REM - 

ROUTINE TO OUTPUT 

4010 

s REM - 

VALUE 0 T IS TH 

4020 

s REM - 

INDICATOR, V IS 

4030 

ö REM - 

NUMBER VALUE, A 

4040 

s REM - 

IS THE STRING V 


4050 PRINT#1,T;",";V 
4060 PRINT#!,S$ 

4 07 0 RETURN 
5000 END 
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III. Sample BASIC prograra that reads a DIF file 


This program uses the output DIF file from the sample program in 
Appendix B to calculate an average score and letter grade for 
each Student. 


100 : REM - THIS PROGRAM READS A DIF FILE CONTAINING THE 

110 :REM - TEST SCORES OF A GROUP OF STUDENTS, CALCULAT 

120 :REM - AN AVERAGE SCORE FOR EACH STUDENT, MATCHES T 

130 :REM - AVERAGE TO A LETTER GRADE, AND PRINTS THE 

140 :REM - STUDENT*S NAME, AVERAGE, AND LETTER GRADE. 


500 
510 
520 
530 
5 35 
540 
550 
560 
570 
575 
580 
590 
600 
610 
620 
630 
640 
650 
660 
670 
680 
690 
700 
710 
720 


DIM T(100) 
DIM V(100) 
DIM V$ (100) 


GOSUB 5000 
GOSUB 6000 
FOR I = 1 TO NT 
GOSUB 7000 


sREM - MAXIMUM OF 100 VECTORS. 
:REM -TIS THE TYPE INDICATOR, V 
:REM - THE NUMBER VALUE, AND V 

:REM - THE STRING VALUE OF EAC 

:REM - VALUE. 

sREM - INITIALIZATION SUBROUTINE. 

:REM - SUBROUTINE TO READ HEADER. 

:REM - FOR EACH TUPLE, 

:REM - GET ALL VECTOR ELEMENTS 

:REM - TUPLE. 

M=0 :REM - M IS THE SUM OF THE SCO 

FOR J = 1 TO NV :REM - FOR EACH VECTOR VALUE, 

IF T(J)=1 THEN PRINT V$(J) :REM - PRINT NAME. 

IF T(J)=0 THEN M = M+V(J) :REM - ADD SCORES. 

NEXT J 

M = M/(NV-1) : PRINT M :REM - PRINT STUDENT 1 S AVERAG 
IF M<=50 THEN PRINT "THIS STUDENT*S FINAL GRADE IS 
IF M< = 70 AND M>50 THEN PRINT "THIS STUDENT*S FINAL 

IF M< = 85 AND M>70 THEN PRINT "THIS STUDENT*S FINAL 

IF M< = 94 AND M>85 THEN PRINT "THIS STUDENT *S FINAL 

IF M>94 THEN PRINT "THIS STUDENT*S FINAL GRADE IS A 

NEXT I 
CLOSE 2 

PRINT "FINISHED CALCULATING GRADES" 

STOP 


5000 

5010 

PRINT "FILE NAME"? 

: REM 

5020 

INPUT F $ 


5030 

OPEN "I",2,F$ 

: REM 

5040 

NV = 0 

:REM 

5050 

NT = 0 

: REM 

5060 

RETURN 



INITIALIZATION CODE. 


OPEN FILE FOR INPUT. 
INITIAL VECTOR COUNT. 
INITIAL TUPLE COUNT. 
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6000 

6010 

6020 

6030 

6040 

6050 

6060 

6065 

6070 

6500 

6510 

6520 

6530 

6540 

6600 

6610 


:REM - READ HEADER. GET NUMBER OF VECTORS AND TUPL 
INPUT#2, T$ :REM - GET TOPIC. 

INPUT#2,S ,N :REM - GET VECTOR NUMBER AND VALU 

INPUT#2,S$ :REM - GET STRING VALUE. 

IF T$="VECTORS" THEN 6500:REM - CHECK FOR KNOWN HEAD 
IF T$="TUPLES" THEN 6600 :REM - ITEMS. 

IF T$="DATA n THEN RETURN 

:REM - "DATA" ENDS HEADER. 

GOTO 6010 :REM - IGNORE UNKNOWN ITEMS. 

NV = N :REM - NUMBER OF VECTORS. 

IF NV< = 10 0 THEN 6010 :REM - CHECK FOR 10 0 OR LESS VE 
PRINT "TOO MANY VECTORS. PROGRAM CAPACITY 100 VECTOR 
CLOSE 2 
STOP 

NT = N :REM - NUMBER OF TUPLES. 

GOTO 6010 :REM - GET NEXT HEADER ITEM. 


7000 :REM - SUBROUTINE TO GET ALL VECTOR ELEMENTS IN A 

7010 GOSUB 8000 :REM - GET NEXT DATA VALUE. 

7020 IF Tlo-l THEN 9000 : REM ~ MUST BE BOT, ELSE ERROR 


7030 

IF S$< > 11 BOT" THEN. 

9000 



7040 

FOR K = 1 TO NV 

:REM 

- GET EACH DATA VALUE 

7050 

GOSUB 8000 




7060 

IF T1>1 THEN 

9000 



7070 

T(K) = Tl 

: REM 

- SAVE 

TYPE INDICATOR 

7080 

V(K) = VI 

:REM 

- SAVE 

NUMBER VALUE. 

7090 

V$(K) = S$ 

: REM 

- SAVE 

STRING VALUE. 

7100 

NEXT K 




7110 

RETURN 





8000 :REM - SUBROUTINE TO GET NEXT DATA VALUE. 

8010 INPUT#2,TI,VI :REM - GET TYPE INDICATOR, NUMERIC 

8020 INPUT#2,S$ :REM - VALUE, AND STRING VALUE. 

8030 RETURN 

9000 :REM - ERROR ROUTINE 

9010 PRINT "ERROR IN FILE FORMAT" 

9020 CLOSE 2 :REM - END PROGRAM 

9030 STOP 
9040 END 
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IV. Sample Pascal program using a DIF file 


This program is a Pascal program that reads data from a DIF file 
into an array and displays the results on the terminal« 

{ This is a simple program which reads DIF file data into an arra 
displays the results on the terminal. It makes use of a proced 
called "get_dif_array” which handles only numeric data« It is 
for Apple Pascal 1.1 and may require modification to run on oth 

program dif_read; 

const 

max_vector = 10; 

max_tuple = 10; 

type 

vector^index = 0. .max_vector ; 
tuple_index = 0 . . max_tuple ; 

dif_array = array[1.«max_vector, 1..max_tuple] of real; 

var 

: vector_index; 

: tuple_index; 

; integer; 


in_file 
f na me 
matrix 


text; 

string[15]; 
dif array; 


num_vector s 
num_tuples 
code, i, j 


{ maximum number of 
{ maximum number of 
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{ "Get_dif_array n reads a DIF file and ceturns the file data (cur 
only numeric) in an array. Also returns number of vectors and 
— these must be specified in file header — and an error code. 


procedure get_dif_array (var dif_file: text; var real_array: dif_ 

var nvectors: vector_index; var ntupless 
var return_code: integer); 


const 

special = -1; numeric 


{ currently defined data types } 
0; char_string = ly other = 


type 

header item 


data value 


record 

topic 

vec tor_num 
value 

String_value 
end ; 
record 
kind 

number_value 
string_value 
end; 


string; 
vector_index; 
integer; 
string 


-1..2; { currently defined 

real; 
string 


var 

hdr_item : header_item; 

data_val : data_value; 

tuple, vector : integer; 


{ "Read_integer" reads an integer terminated with a comma. Th 
is required because this Pascal dialect's "read" procedure r 
only <space>, eoln and eof as delimiters of integer values, 

procedure read_integer (var number: integer); 
var 

sign, magnitude : integer; 
ch : char; 
begin 

sign := 1; magnitude := 0; j initialize } 

read (difile, ch) ; { get Ist chara 

while ch <> 1 , 1 do { comma is deli 

begin 

case ch of 

' - 1 : s ig n : = -1; 

: magnitude := magnitude * 10 + ord(ch) - or 
end; { case } 

read (difile, ch) { get next char 

end; 

number := sign * magnitude { return result 

end; { read_integer } 
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{ "Read_string" deletes leading and trailing 
quotes from quoted strings. } 

procedure read_string (var str: string); 
beg in 

readln (difile, str) ; 
while str [1] = 1 1 do 
delete (str, 1, 1); 
if str [1] = ' 

then begin 

dele te (str, 1, 1) ; 
delete (str, pos('" 1 , str), 

length(str) - pos ('" 1 , 

end 

eise if pos(' ', str) > 0 

then delete (str, pos(’ ', str), 

length(str) - pos (* 

end? { read_string } 


blanks and strip 

{ leading blank 
{ strip quotes 

str) + 1) 

{ trailing bla 
' . str) + 1) 
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procedure read_header_item (var item; header_item) ; 
begin 

read_string (item.topic); { get topic } 

read_integer (item.vector _num) ; { get vector n 

readln (difile, item.value) ; \ get value } 

read string (item.stringvalue) 1 get string v 

end; ~[ read_header_item } 

procedure read_data_value (var value; data_value); 
beg in 

read_integer (value. kind) { get data typ 

readln (difile, value.number_value); \ get number v 

read string (value.string_value) { get string v 

end; X read_data_value } 

begin { get_dif_array } 
return_code ;= 0; 
nvectors ;= 0; ntuples := 0; 
repeat 

read_header_item (hdr_item); 
if hdr_item.topic = ’VECTORS 8 

then nvectors ;= hdr_item.value 
eise if hdr_item.topic = 'TUPLES 1 
then ntuples := hdr_item.value 
until hdr_item.topic = 'DATA'; 

if (nvectors = 0) or (ntuples = 0) ( check counts 

then return_code := 1 
eise begin 

for tuple ;= 1 to ntuples do { read data j 

begin 

read_data_value (data_val); ( BOT \ 

for vector ;= 1 to nvectors do 
beg in 

read_data_value (data_val); 
if data_val.kind = numeric 

then real_array[vector, tuple] := 

data_val.number_value 

end 

end; 

read_data_value (data_val); { EOD } 

if (data_val.kind <> special) or 

(data_val.string_value <> ’E0D s ) 
then return_code := 2 

end 

end; { get_dif_array } 


{ assume no pr 
{ initialize } 
{ read header 


t vector count 
{ tuple count 
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begin { dif_read } 

( get DIF file name } 

{ open and point to BOF } 


writeln; 

write ('DIF file name: '); 

readln (fname); 

reset (in_file, fname); 


get__dif_ar ray (in_file, matrix, num_vectors, num_tuples, code) 

close (in_file); { close DIF file } 

case code of { display results } 

0: begin 

wr i te ln; 

wr iteln (* " 1 , fname: 15, ' ' contains num_vectors: 

1 vectors and num_tuples:3, ' tuples.'); 

writeln ('The data values follow in tuple Order:'); 
writeln; 

for i := 1 to num_tuples do 
begin 

for j := 1 to num_vectors do 
write (matrix[j, ij:10:2); 
writeln 
end; 
writeln 
end; 

1: writeln ('Error. Tuple or vector count not found.'); 

2: writeln ('Error. Data not proper ly terminated.') 
end { case } 

end. { dif_read } 
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